SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Wang, Elizabeth A. 

Rosen, vicki A. 
Wozney, John M. 

(ii) TITLE OF INVENTION: Novel BMP Products 

(iii) NUMBER OF SEQUENCES: 6 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: LEGAL AFFAIRS, GENETICS INSTITUTE, INC. 

(B) STREET: 87 CAMBRIDGEPARK DRIVE 

(C) CITY: CAMBRIDGE 

(D) STATE: MA 

(E) COUNTRY: USA 

(F) ZIP: 02140 

COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

£j (Vi) CURRENT APPLICATION DATA: 
± (A) APPLICATION NUMBER: US 

*f 3 (B) FILING DATE: 

jU (C) CLASSIFICATION: 

fviii) ATTORNEY/ AGENT INFORMATION: 
D (A) NAME: KAPINOS, ELLEN J. 

hj (B) REGISTRATION NUMBER: 32,245 

p (C) REFERENCE/DOCKET NUMBER: GI5160C 

p (ix) TELECOMMUNICATION INFORMATION: 
a (A) TELEPHONE: 617-876-1170 

(B) TELEFAX: 617-876-5851 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 592 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: BOS TAURUS 

(vii) IMMEDIATE SOURCE: 




(A) LIBRARY :^^TRE GENOMIC IN LAMBDA J 

(B) CLONE: LAMBDA BP- 21 

) POSITION IN GENOME: 

(C) UNITS: bp 



X) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..390 

/ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 46.. 387 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

CAC GAT GGG AAA GGA CAC CCT CTC CAC AGA AGA GAA AAG CGG CAA 
. His Asp Gly Lys Gly His Pro Leu His Arg Arg Glu Lys Arg Gin 
5 -10 -5 1 

iA AAA CAC AAA CAG CGG AAA CGC CTC AAG TCC AGC TGT AAG AGA CAC 
La Lys His Lys Gin Arg Lys Arg Leu Lys Ser Ser Cys Lys Arg His 
5 10 15 



/I 



/ 



;CTg]TTA TAT GTG GAC TTC AGT GAT GTG GGG TGG AAT GAC TGG ATC GTT 
ProrjLeu Tyr Val Asp Phe Ser Asp Val Gly Trp Asn Asp Trp He Val 
' - 20 25 30 



3CA55CG CCG GGG TAT CAT GCC TTT TAC TGC CAT GGG GAG TGC CCT TTT 
Ua-SPro Pro Gly Tyr His Ala Phe Tyr Cys His Gly Glu Cys Pro Phe 

J' 35 40 45 

SCCHCTG GCC GAT CAC CTT AAC TCC ACG AAT CAT GCC ATT CTC CAA ACT 
Pro^Leu Ala Asp His Leu Asn Ser Thr Asn His Ala He Leu Gin Thr 
50 $ 55 60 65 

3TGDGTC AAC TCA GTT AAC TCT AAG ATT CCC AAG'GCA TGC TGT GTC CCA 
Leu=iVal Asn Ser Val Asn Ser Lys He Pro Lys Ala Cys Cys Val Pro 

70 75 80 

\CA GAG CTC AGC GCC ATC TCC ATG CTG TAC CTT GAT GAG AAT GAG AAG 
Thr Glu Leu Ser Ala He Ser Met Leu Tyr Leu Asp Glu Asn Glu Lys 
85 90 95 

3TG GTA TTA AAG AAC TAT CAG GAC ATG GTT GTC GAG GGT TGT GGG TGT 
/al Val Leu Lys Asn Tyr Gin Asp Met Val Val Glu Gly Cys Gly Cvs 
100 105 110 

:gt TAGCACAGCA aaataaaata taaatatata tatatatata ttagaaaaac 
\rg 

115 

AGCAAAAAAA TCAAGTTGAC ACTTTAATAT TTCCCAATGA AGACTTTATT TATGGAATGG 
AATGGAGAAA AAGAAAAACA CAGCTATTTT GAAAACTATA tttatatcta CCGAAAAGAA 



48 

96 

144 

192 

240 

288 

336 

384 

437 

497 
557 



3TTGGGAAAA CAAATATTTT AATCAGAGAA TTATT 



592 




CNFORMATION FOR ^!fNci:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 129 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ'ID NO: 2: 

Gly His Asp Gly Lys Gly His Pro Leu His Arg Arg Glu Lys Arg Gin 
-15 -10 ^" -5 1 

Ala Lys His Lys Gin Arg Lys Arg Leu Lys Ser Ser Cys Lys Arg His 
5 10 15 

Pro Leu Tyr Val Asp Phe Ser Asp Val Gly Trp Asn Asp Trp lie Val 
20 25 30 

Ala Pro Pro Gly Tyr His Ala Phe Tyr Cys His Gly Glu Cys Pro Phe 
35 40 45 

Pro Leu Ala Asp His Leu Asn Ser Thr Asn His Ala lie 7 Ley Gin Thr 
|9 55 60 65 

L#|i Val Asn Ser Val Asn Ser Lys lie Pro Lys Ala Cys Cys Val Pro 
J 70 75 80 

Tir Glu Leu Ser Ala lie Ser Met Leu Tyr Leu Asp Glu Asn Glu Lys 

|y 85 90 95 

Val Val Leu Lys Asn Tyr Gin Asp Met Val Val Glu Gly Cys Gly Cys 
D 100 105 110 



(i) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1607 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 

(G) CELL TYPE: Osteosarcoma Cell Line 

(H) CELL LINE: U-20S 



(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: U20S cDNA in Lambda GT10 




(B) CLONE: 9585 b^0S-39 

(viii) POSITION IN GENOME: 

(C) UNITS: bp 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 356.. 1546 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 1202.. 1543 

(ix) FEATURE: 

(A) NAME/KEY: mRNA 

(B) LOCATION: 14.. 1607 

(ix) FEATURE: 

(A) NAME/KEY: sig__peptide 

(B) LOCATION: 356.. 424 



O (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

GTJGACTCTA GAGTGTGTGT CAGCACTTGG CTGGGGACTT CTTGAACTTG CAGGGAGAAT 60 

AAlTTGCGCA CCCCACTTTG CGCCGGTGCC TTTGCCCCAG CGGAGCCTGC TTCGCCATCT 120 

CG^AGCCCCA CCGCCCCTCC ACTCCTCGGC CTTGCCCGAC ACTGAGACGC TGTTCCCAGC 180 

G^BaAAAGAG AGACTGCGCG GCCGGCACCC GGGAGAAGGA GGAGGCAAAG AAAAGGAACG 240 

Gft^ATTCGGT CCTTGCGCCA GGTCCTTTGA CCAGAGTTTT TCCATGTGGA CGCTCTTTCA 300 

ATGGACGTGT CCCCGCGTGC TTCTTAGACG GACTGCGGTC TCCTAAAGGT CGACC ATG 358 
£ . Met 

H ~ 282 

GTG GCC GGG ACC CGC TGT CTT CTA GCG TTG CTG CTT CCC CAG GTC CTC 406 
Val Ala Gly Thr Arg Cys Leu Leu Ala Leu Leu Leu Pro Gin Val Leu 
-280 -275 -270 

CTG GGC GGC GCG GCT GGC CTC GTT CCG GAG CTG GGC CGC AGG AAG TTC 454 
Leu Gly Gly Ala Ala Gly Leu Val Pro Glu Leu Gly Arg Arg Lys Phe 
-265 -260 -255 -250 

GCG GCG GCG TCG TCG GGC CGC CCC TCA TCC CAG CCC TCT GAC GAG GTC 502 
Ala Ala Ala Ser Ser Gly Arg Pro Ser Ser Gin Pro Ser Asp Glu Val 

-245 -240 -235 

CTG AGC GAG TTC GAG TTG CGG CTG CTC AGC ATG TTC GGC CTG AAA CAG 550 
Leu Ser Glu Phe Glu Leu Arg Leu Leu Ser Met Phe Gly Leu Lys Gin 
-230 -225 -220 

AGA CCC ACC CCC AGC AGG GAC GCC GTG GTG CCC CCC TAC ATG CTA GAC 598 
Arg Pro Thr Pro Ser Arg Asp Ala Val Val Pro Pro Tyr Met Leu Asp 
-215 -210 -205 



CTG TAT CGC AGG CAC TCA GGT CAG CCG GGC TCA CCC GCC CCA GAC CAC 



646 



i^t^ uln Pro Gly Ser Pro A^lf^ 



Leu Tyr Arg Arg His Seij&F? uln Pro Gly Ser Pro Al^Prb .»sp His 
-200 -195 -190 

2GG TTG GAG AGG GCA GCC AGC CGA GCC AAC ACT GTG CGC AGC TTC CAC 694 
*rg Leu Glu Arg Ala Ala Ser Arg Ala Asn Thr Val Arg Ser Phe His 
-185 -180 -175 -170 

2AT GAA GAA TCT TTG GAA GAA CTA CCA GAA ACG AGT GGG AAA ACA ACC 742 
iis Glu Glu Ser Leu Glu Glu Leu Pro Glu Thr Ser Gly Lys Thr Thr 

-165 -»160 -155 

-GG AGA TTC TTC TTT AAT TTA AGT TCT ATC CCC ACG GAG GAG TTT ATC 790 
\rq Arg Phe Phe Phe Asn Leu Ser Ser lie Pro Thr Glu Glu Phe lie 
-150 -145 -140 

\CC TCA GCA GAG CTT CAG GTT TTC CGA GAA CAG ATG CAA GAT GCT TTA 838 
Thr Ser Ala Glu Leu Gin Val Phe Arg Glu Gin Met Gin Asp Ala Leu 
-135 -130 -125 



3GA AAC AAT AGC AGT TTC CAT CAC CGA ATT AAT ATT TAT GAA ATC ATA 886 
31y Asn Asn Ser Ser Phe His His Arg lie Asn lie Tyr Glu lie lie 
-120 -115 -110 

*A2P CCT GCA ACA GCC AAC TCG AAA TTC CCC GTG ACC AGA CTT TTG GAC 934 
Lys) Pro Ala Thr Ala Asn Ser Lys Phe Pro Val Thr Arg Leu Leu Asp 
-lfS -100 -95 -90 

ACl AGG TTG GTG AAT CAG AAT GCA AGC AGG TGG GAA AGT TTT GAT GTC 982 
rhs Arg Leu Val Asn Gin Asn Ala Ser Arg Trp Glu Ser Phe Asp Val 
fl| -85 -80 -75 

ACti' CCC GCT GTG ATG CGG TGG ACT GCA CAG GGA CAC GCC AAC CAT GGA 1030 
Thk Pro Ala Val Met Arg Trp Thr Ala Gin Gly His Ala Asn His Gly 
i~ -70 -65 -60 

rT<§ GTG GTG GAA GTG GCC CAC TTG GAG GAG AAA CAA GGT GTC TCC AAG 1078 
Ph& Val Val Glu Val Ala His Leu Glu Glu Lys Gin Gly Val Ser Lys 
U -55 -50 * -45 

AGA CAT GTT AGG ATA AGC AGG TCT TTG CAC CAA GAT GAA CAC AGC TGG 1126 
Arg His Val Arg lie Ser Arg Ser Leu His Gin Asp Glu His Ser Trp 
-40 -35 -30 

TCA CAG ATA AGG CCA TTG CTA GTA ACT TTT GGC CAT GAT GGA AAA GGG 1174 
Ser Gin lie Arg Pro Leu Leu Val Thr Phe Gly His Asp Gly Lys Gly 
-25 -20 -15 -10 

CAT CCT CTC CAC AAA AGA GAA AAA CGT CAA GCC AAA CAC AAA CAG CGG 1222 
His Pro Leu His Lys Arg Glu Lys Arg Gin Ala Lys His Lys Gin Arg 

-5 1 5 

AAA CGC CTT AAG TCC AGC TGT AAG AGA CAC CCT TTG TAC GTG GAC TTC 1270 
Lys Arg Leu Lys Ser Ser Cys Lys Arg His Pro Leu Tyr Val Asp Phe 
10 15 20 

AGT GAC GTG GGG TGG AAT GAC TGG ATT GTG GCT CCC CCG GGG TAT CAC 1318 
Ser Asp Val Gly Trp Asn Asp Trp lie Val Ala Pro Pro Gly Tyr His 
25 30 35 

GCC TTT TAC TGC CAC GGA GAA TGC CCT TTT CCT CTG GCT GAT CAT CTG 1366 



Ly^P^ cys Pro Phe Pro Leu All^Hp 



aa Phe Tyr Cys His Gly^By cys Pro Phe Pro Leu Al^Hp **is Leu 
40 45 50 55 

iAC TCC ACT AAT CAT GCC ATT GTT CAG ACG TTG GTC AAC TCT GTT AAC 1414 
vsn Ser Thr Asn His Ala lie Val Gin Thr Leu Val Asn Ser Val Asn 

60 65 70 

?CT AAG ATT CCT AAG GCA TGC TGT GTC CCG ACA GAA CTC AGT GCT ATC 1462 
Jer Lys lie Pro Lys Ala Cys Cys Val Pro Thr Glu Leu Ser Ala lie 
75 80 • 85 

?CG ATG CTG TAC CTT GAC GAG AAT GAA AAG GTT GTA TTA AAG AAC TAT 1510 
Jer Met Leu Tyr Leu Asp Glu Asn Glu Lys Val Val Leu Lys Asn Tyr 
90 95 100 

:AG GAC ATG GTT GTG GAG GGT TGT GGG TGT CGC TAGTACAGCA AAATTAAATA 1563 
Jin Asp Met Val Val Glu Gly Cys Gly Cys Arg 

105 110 115 

IATAAATATA TATATATATA TATATTTTAG AAAAAAGAAA AAAA 1607 

!2) INFORMATION FOR SEQ ID NO: 4: 

J (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 396 amino acids 
S (B) TYPE: amino acid 

'% (D) TOPOLOGY: linear 

J (ii) MOLECULE TYPE: protein 

W (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

let£Val Ala Gly Thr Arg Cys Leu Leu Ala Leu Leu Leu Pro Gin Val 
"282; -280 -275 -270 

-rfiujteu Gly Gly Ala Ala Gly Leu Val Pro Glu .Leu Gly Arg Arg Lys 
P" 265 -260 * -255 

he Ala Ala Ala Ser Ser Gly Arg Pro Ser Ser Gin Pro Ser Asp Glu 
250 -245 -240 -235 

» 

/al Leu Ser Glu Phe Glu Leu Arg Leu Leu Ser Met Phe Gly Leu Lys 

-230 -225 -220 

;in Arg Pro Thr Pro Ser Arg Asp Ala Val Val Pro Pro Tyr Met Leu 
-215 -210 -205 

isp Leu Tyr Arg Arg His Ser Gly Gin Pro Gly Ser Pro Ala Pro Asp 
-200 -195 _ 190 

lis Arg Leu Glu Arg Alaf Al^ Ser Arg Ala Asn Thr Val Arg Ser Phe 
-185 ^— 180 -175 

lis His Glu Glu, Ser Leu Glu Glu Leu Pro Glu Thr Ser Gly Lys Thr 
-170 _ 165 _ 160 _ 155 

Chr Arg Arg Phe Phe Phe Asn Leu Ser Ser He Pro Thr Glu Glu Phe 

-150 -145 -140 



s^HV *al Phe Arg Glu Gin mJHK 



lie Thr Ser Ala Glu Le^PPr ,al Phe Arg Glu Gin MdHUh «*sp Ala 
-135 ^ -130 - 12 5 

Leu Gly Asn Asn Ser Ser Phe His His Arg lie Asn lie Tyr Glu lie 
-120 -115 -110 

He Lys Pro Ala Thr Ala Asn Ser Lys Phe Pro Val Thr Arg Leu Leu 
-105 -100 -95 

Asp Thr Arg Leu Val Asn Gin Asn Ala Ser Arg Trp Glu Ser Phe Asp 
-90 -85 -80 -75 

Val Thr Pro Ala Val Met Arg Trp Thr Ala Gin Gly His Ala Asn His 

-70 -65 -60 

Gly Phe Val Val Glu Val Ala His Leu Glu Glu Lys Gin Gly Val Ser 
-55 -50 -45 

Lys Arg His Val Arg He Ser Arg Ser Leu His Gin Asp Glu His Ser 
-40 -35 -30 

Trp Ser Gin He Arg Pro Leu Leu Val Thr Phe Gly His Asp Gly Lys 
-25 -20 -15 

Gly~His Pro Leu His Lys Arg Glu Lys Arg Gin Ala Lys His Lys Gin 
-ICE -5 1- 5 

Arg^Lys Arg Leu Lys Ser Ser Cys Lys Arg His Pro Leu Tyr Val Asp 

PheuSer Asp Val Gly Trp Asn Asp Trp He val Ala Pro Pro Gly Tyr 

«1 25 30 35 

HisjAla Phe Tyr Cys His Gly Glu Cys Pro Phe Pro Leu Ala Asp His 
m 40 45 50 

LeuJjAsn Ser Thr Asn His Ala He Val Gin Thr Leu Val Asn Ser Val 

5 ?3 60 65\ 70 

AsifSer Lys He Pro Lys Ala Cys Cys Val Pro Thr Glu Leu Ser Ala 

75 80 85 

He Ser Met Leu Tyr Leu Asp Glu Asn Glu Lys Val Val Leu Lys Asn 
90 95 100 

Tyr Gin Asp Met Val Val Glu Gly Cys Gly Cys Arg 
105 110 „ H 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1954 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA to mRNA 
(iii) HYPOTHETICAL: NO 




(iv) ANTI-SENSE: NO 



(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 

(G) CELL TYPE: Osteosarcoma Cell Line 

(H) CELL LINE: U-20S 

(Vii) IMMEDIATE SOURCE: 

(A) LIBRARY: U20S cDNA in Bambda gtlO 

(B) CLONE: Lambda U20S-3 

(viii) POSITION IN GENOME: 

(C) UNITS: bp 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 403.. 1629 

(ix) FEATURE: 

(A) NAME/ KEY : mat_peptide 

(B) LOCATION: 1279.. 1626 

n (ix) FEATURE: 

(A) NAME/KEY: mRNA 
m (B) LOCATION: 9.. 1934 




j (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 








CTdfAGAGGG 


CAGAGGAGGA 


GGGAGGGAGG 


GAAGGAGCGC 


GGAGCCCGGC 


CCGGAAGCTA 


60 


GGTGAGTGTG 


GCATCCGAGC 


TGAGGGACGC 


GAGCCTGAGA 


CGCCGCTGCT 


GCTCCGGCTG 


120 


AGTATCTAGC 


TTGTCTCCCC 


GATGGGATTC 


CCGTCCAAGC 


TATCTCGAGC 


CTGCAGCGCC 


180 


&c|gtccccg 


GCCCTCGCCC 


AGGTTCACTG 


CAACCGTTCA 


GAGGTCCCCA 


GGAGCTGCTG 


240 


CTflCGAGCC 


CGCTACTGCA 


GGGACCTATG 


GAGCCATTCC 


GTAGTGCCAT 


CCCGAGCAAC 


300 


3CACTGCTGC 


AGCTTCCCTG 


AGCCTTTCCA 


GCAAGTTTGT 


TCAAGATTGG 


CTGTCAAGAA 


360 


TCATGGACTG 


TTATTATATG 


CCTTGTTTTC 


TGTCAAGACA 


CC ATG ATT 
Met He 
-292 


CCT GGT 
Pro Gly 
-290 


414 



AAC CGA ATG CTG ATG GTC GTT TTA TTA TGC CAA GTC CTG CTA GGA GGC 462 
Asn Arg Met Leu Met Val Val Leu Leu Cys Gin Val Leu Leu Gly Gly 
-285 -280 -275 

GCG AGC CAT GCT AGT TTG ATA CCT GAG ACG GGG AAG AAA AAA GTC GCC 510 
Ala Ser His Ala Ser Leu He Pro Glu Thr Gly Lys Lys Lys Val Ala 
-270 -265 -260 

GAG ATT CAG GGC CAC GCG GGA GGA CGC CGC TCA GGG CAG AGC CAT GAG 558 
Glu He Gin Gly His Ala Gly Gly Arg Arg Ser Gly Gin Ser His Glu 
-255 -250 -245 

CTC CTG CGG GAC TTC GAG GCG ACA CTT CTG CAG ATG TTT GGG CTG CGC 606 
Leu Leu Arg Asp Phe Glu Ala Thr Leu Leu Gin Met Phe Gly Leu Arg 




-240 -23^P^ -230 -225 



CGC CGC CCG GAG CCT AGC AAG AGT GCC GTC ATT CCG GAC TAC ATG CGG 654 
Arg Arg Pro Gin Pro Ser Lys Ser Ala Val lie Pro Asp Tyr Met Arg 

-220 -215 -210 

SAT CTT TAC CGG CTT CAG TCT GGG GAG GAG GAG GAA GAG CAG ATC CAC 702 
Asp Leu Tyr Arg Leu Gin Ser Gly Glu Glu Glu Glu Glu Gin lie His 
-205 -200 -195 

AGC ACT GGT CTT GAG TAT CCT GAG CGC CCG GCC AGC CGG GCC AAC ACC 750 
Ser Thr Gly Leu Glu Tyr Pro Glu Arg Pro Ala Ser Arg Ala Asn Thr 
-190 -185 -180 

3TG AGG AGC TTC CAC CAC GAA GAA CAT CTG GAG AAC ATC CCA GGG ACC 798 
7al Arg Ser Phe His His Glu Glu His Leu Glu Asn lie Pro Gly Thr 
-175 -170 -165 

\GT GAA AAC TCT GCT TTT CGT TTC CTC TTT AAC CTC AGC AGC ATC CCT 846 
Ser Glu Asn Ser Ala Phe Arg Phe Leu Phe Asn Leu Ser Ser lie Pro 
"160 -155 -150 -145 

SAG AAC GAG GTG ATC TCC TCT GCA GAG CTT CGG CTC TTC CGG GAG CAG 894 
Slu^Asn Glu Val lie Ser Ser Ala Glu Leu Arg Leu Phe Arg Glu Gin 
J -140 -135 -130 

STG^GAC CAG GGC CCT GAT TGG GAA AGG GGC TTC CAC CGT ATA AAC ATT 942 
ifaJMAsp Gin Gly Pro Asp Trp Glu Arg Gly Phe His Arg lie Asn He 
J -125 -120 -115 

TAfUGAG GTT ATG AAG CCC CCA GCA GAA GTG GTG CCT GGG CAC CTC ATC 990 
ryj&IGlu val Met Lys Pro Pro Ala Glu Val Val Pro Gly His Leu He 
- -HO -105 -100 

^C%jCGA CTA CTG GAC ACG AGA CTG GTC CAC CAC AAT GTG ACA CGG TGG 1038 
ThjhArg Leu Leu Asp Thr Arg Leu Val His His Asn Val Thr Arg Trp 
jj-95 -90 -85 

A 

3A fc ACT TTT GAT GTG AGC CCT GCG GTC CTT CGC TGG ACC CGG GAG AAG 1086 
31ii Thr Phe Asp Val Ser Pro Ala Val Leu Arg Trp Thr Arg Glu Lys 
"80 -75 -70 -65 

CAG CCA AAC TAT GGG CTA GCC ATT GAG GTG ACT CAC CTC CAT CAG ACT 1134 
31n Pro Asn Tyr Gly Leu Ala He Glu Val Thr His Leu His Gin Thr 

-60 -55 -50 

CGG ACC CAC CAG GGC CAG CAT GTC AGG ATT AGC CGA TCG TTA CCT CAA 1182 
Arg Thr His Gin Gly Gin His Val Arg He Ser Arg Ser Leu Pro Gin 
-45 -40 -35 

3GG AGT GGG AAT TGG GCC CAG CTC CGG CCC CTC CTG GTC ACC TTT GGC 12 3 0 

31y Ser Gly Asn Trp Ala Gin Leu Arg Pro Leu Leu Val Thr Phe Gly 
"30 -25 -20 

CAT GAT GGC CGG GGC CAT GCC TTG ACC CGA CGC CGG AGG GCC AAG CGT 1278 
His Asp Gly Arg Gly His Ala Leu Thr Arg Arg Arg Arg Ala Lys Arg 
-15 -10 -5 

\GC CCT AAG CAT CAC TCA CAG CGG GCC AGG AAG AAG AAT AAG AAC TGC 132 6 

3er Pro Lys His His Ser Gin Arg Ala Arg Lys Lys Asn Lys Asn Cys 




10 15 



C6G 
Arg 


CGC 
Arg 


CAC 
His 


TCG 
Ser 
20 


CTC 
Leu 


TAT 
Tyr 


GTG 
Val 


GAC 
Asp 


TTC 
Phe 
25 


AGC 
Ser 


GAT 
Asp 


GTG 
Val 


GGC 
Gly 


TGG 
Trp 
30 


AAT 
Asn 


GAC 
Asp 


1374 


TGG 
Trp 


ATT 
lie 


GTG 
Val 
35 


GCC 
Ala 


CCA 
Pro 


CCA 
Pro 


GGC 
Gly 


TAC 
Tyr 
40 


CAG 
Gin 


GCC 
Ala 


TTC 
Phe 


TAC 
Tyr 


TGC 

Cys 

45 


CAT 
His 


GGG 
Gly 


GAC 
Asp 


1422 


TGC 
Cys 


CCC 
Pro 
50 


TTT 
Phe 


CCA 
Pro 


CTG 
Leu 


GCT 
Ala 


GAC 
Asp 
55 


CAC 
His 


CTC 
Leu 


AAC 
Asn 


TCA 
Ser 


ACC 
Thr 
60 


AAC 
Asn 


CAT 
His 


GCC 
Ala 


ATT 
He 


1470 


GTG 
Val 
65 


CAG 
Gin 


ACC 
Thr 


CTG 
Leu 


GTC 
Val 


AAT 
Asn 
70 


TCT 
Ser 


GTC 
Val 


AAT 
Asn 


TCC 
Ser 


AGT 
Ser 
75 


ATC 
He 


CCC 
Pro 


AAA 

Lys 


GCC 
Ala 


TGT 
Cvs 
80 


1518 


TGT 
Cys 


GTG 
Val 


CCC 
Pro 


ACT 
Thr 


GAA 
Glu 
85 


CTG 
Leu 


AGT 
Ser 


GCC 
Ala 


ATC 
He 


TCC 
Ser 
90 


ATG 
Met 


CTG 
Leu 


TAC 
Tyr 


CTG 
Leu 


GAT 
Asd 
95 


GAG 
GlU 


1566 


TAT 
Tyr 


GAT 
Asp 


AAG 
Lys 


GTG 
Val 
100 


GTA 
Val 


CTG 
Leu 


AAA 
Lys 


AAT 
Asn 


TAT 
Tyr 
105 


CAG 
Gin 


GAG 
Glu 


ATG 
Met 


GTA 
Val 


GTA 
Val 
110 


GAG 
GlU 


GGA 
Gly 


1614 


TGT 
Cyis 


GGG 
Gly 


TGC 
Cys 
115 


CGC 
Arg 


TGAGATCAGG CAGTCCTTGA GGATAGACAG ATATACACAC 




1666 



C^ACACACA CACCACATAC ACCACACACA CACGTTCCCA TCCACTCACC CACACACTAC 1726 

A|AGACTGCT TCCTTATAGC TGGACTTTTA TTTAAAAAAA AAAAAAAAAA AATGGAAAAA 1786 

ATCCCTAAAC ATTCACCTTG ACCTTATTTA TGACTTTACG TGCAAATGTT TTGACCATAT 1846 

TGATCATATA TTTTGACAAA ATATATTTAT AACTACGTAT TAAAAGAAAA AAATAAAATG 1906 

AGTCATTATT TTAAAAAAAA AAAAAAAACT CTAGAGTCGA CGGAATTC 1954 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 408 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met He Pro Gly Asn Arg Met Leu Met Val Val Leu Leu Cys Gin Val 
-292 -290 -285 -280 

Leu Leu Gly Gly Ala Ser His Ala Ser Leu He Pro Glu Thr Gly Lys 
"275 -270 -265 

Lys Lys Val Ala Glu He Gin Gly His Ala Gly Gly Arg Arg Ser Gly 
-260 -255 -250 -245 



uTrg I 



iuleu 



Gin Ser His Glu Leu LeuT&rg Asp Phe Glu Ala Thr LexiXeu Gin Met 

-240 -235 -230 

Phe Gly Leu Arg Arg Arg Pro Gin Pro Ser Lys Ser Ala Val lie Pro 
-225 -220 -215 

Asp Tyr Met Arg Asp Leu Tyr Arg Leu Gin Ser Gly Glu Glu Glu Glu 
-210 -205 -200 

t 

Glu Gin lie His Ser Thr Gly Leu Glu Tyr Pro Glu Arg Pro Ala Ser 
-195 -190 -185 

Arg Ala Asn Thr Val Arg Ser Phe His His Glu Glu His Leu Glu Asn 
-180 -175 -170 

He Pro Gly Thr Ser Glu Asn Ser Ala Phe Arg Phe Leu Phe Asn Leu 

-160 -155 -150 

Ser Ser He Pro Glu Asn Glu Val He Ser Ser Ala Glu Leu Arg Leu 
-145 -140 -135 

Phe Arg Glu Gin Val Asp Gin Gly Pro Asp Trp Glu Arg Gly Phe His 
O -130 -125 -120 

Arg 3? le Asn Ile T Y r Glu Val Met L Y S Pro Pro Ala Glu Val Val Pro 
R-H5 -110 -105 

GlyjHis Leu Ile Thr Arg Leu Leu Asp Thr Arg Leu Val His His Asn 
" 10 Jj -95 -90 -85 

VaBfThr Arg Trp Glu Thr Phe Asp Val Ser Pro Ala Val Leu Arg Trp 
L -80 -75 _ 70 

ThryiArg Glu Lys Gin Pro Asn Tyr Gly Leu Ala Ile Glu Val Thr His 
O -65 -60 -55 

Leujlis Gin Thr Arg Thr His Gin Gly Gin His *Val Arg Ile Ser Arg 
M= -50 -45 -40 

Ser Leu Pro Gin Gly Ser Gly Asn Trp Ala Gin Leu Arg Pro Leu Leu 
-35 -30 -25 

Val Thr Phe Gly His Asp Gly Arg Gly His Ala Leu Thr Arg Arg Arg 
-20 -15 -io -5 

Arg Ala Lys Arg Ser Pro Lys His His Ser Gin Arg Ala Arg Lys Lys 

15 10 

Asn Lys Asn Cys Arg Arg His Ser Leu Tyr Val Asp Phe Ser Asp Val 
15 20 25 

Gly Trp Asn Asp Trp He Val Ala Pro Pro Gly Tyr Gin Ala Phe Tyr 
30 35 40 

Cys His Gly Asp Cys Pro Phe Pro Leu Ala Asp His Leu Asn Ser Thr 
45 50 55 60 

Asn His Ala He Val Gin Thr Leu Val Asn Ser Val Asn Ser Ser Ile 

65 v 70 75 



-165 



'ro Lys Ala Cys Cys Val 
80 



4» 



4 



Thr Glu Leu Ser Ala lie Ser Met Leu 
85 90 



?yr Leu Asp Glu Tyr Asp Lys Val Val Leu Lys Asn Tyr Gin Glu Met 
95 100 105 



r al Val Glu Gly Cys Gly cys Arg 
110 115 



